Acoustic cues identifying phonetic transitions for speech segmentation

نویسنده

  • D. R. van Niekerk
چکیده

The quality of corpus-based text-to-speech (TTS) systems depends strongly on the consistency of boundary placements during phonetic alignments. Expert human transcribers use visually represented acoustic cues in order to consistently place boundaries at phonetic transitions according to a set of conventions. We present some features commonly (and informally) used as aid when performing manual segmentation and investigate the feasibility of automatically extracting and utilising these features to identify phonetic transitions. We show that a number of features can be used to reliably detect various classes of phonetic transitions.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Acoustic-phonetic Cues to Word Boundary Location: Evidence from Word Spotting

This research examined acoustic-phonetic cues to word boundary location in French consonant clusters, and assessed their use in on-line lexical segmentation. Two word-spotting experiments manipulated the alignment between word targets and syllable boundaries. A perceptual cost of such misalignment was observed for obstruent-liquid clusters but not for /s/ + obstruent clusters. For the former cl...

متن کامل

Qualitative Evaluation and Error Analysis of Phonetic Segmentation

Speech segmentation is the process of splitting and identifying the boundaries between different units of speech, i.e., words, syllables, and phones. This paper focuses on the automatic phonetic segmentation of speech and the methods used for its evaluation. We explain the current methods used for the evaluation of speech segmentation and highlight the details that have not been sufficiently ad...

متن کامل

A neural network speech recognizer based on the both acoustic steady portions and transitions

Previous works on speech recognition utilizing neural networks have often relied on either recognition through segmentation or mapping of the representation trajectories to the phoneme space. Here, information could be missed due to the manner of border labeling techniques. Recent works have indicated that firstly, phonetic borders and transitions would have a good potential to be recognized as...

متن کامل

A hybrid approach to automatic segmentation and labeling for Mandarin Chinese speech corpus

In this paper, we propose a hybrid approach to refine the phonetic boundaries in a Mandarin speech corpus. This approach employs different sets of acoustic features for different categories of phonetic transitions, except for the most difficult case of “periodic voiced + periodic voiced”, which is therefore handled by a heuristic scheme. Several experiments are designed to demonstrate the feasi...

متن کامل

Speech Recognition using Acoustic Landmarks and Binary Phonetic Feature Classifiers

In spite of decades of research, Automatic Speech Recognition (ASR) is far from reaching the goal of performance close to Human Speech Recognition (HSR). One of the reasons for unsatisfactory performance of the state-of-the-art ASR systems, that are based largely on Hidden Markov Models (HMMs), is the inferior acoustic modeling of low level or phonetic level linguistic information in the speech...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2008